NON-BRANCHING GEODESICS AND OPTIMAL MAPS 
IN STRONG CD(if,oo)-SPACES 
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Abstract. We prove that in metric measure spaces where the entropy functional is K- 
convex along every Wasserstein geodesic any optimal transport between two absolutely con- 
tinuous measures with finite second moments lives on a non-branching set of geodesies. As a 
corollary we obtain that in these spaces there exists only one optimal transport plan between 
any two absolutely continuous measures with finite second moments and this plan is given 
by a map. 

The results are applicable in metric measure spaces having Riemannian Ricci-curvature 
bounded below, and in particular they hold also for Gromov-Hausdorff limits of Riemannian 
manifolds with Ricci-curvature bounded from below by some constant. 



1. Introduction 

Ricci-curvature lower-bounds in general metric measure spaces were studied by the second 
author in |13t [T5] and at the same time with a similar approach by Lott and Villani in [S] . 
There the lower-bound K € R on the Ricci curvature without a reference to the dimension 
of the metric measure space was defined as if-convexity of the entropy functional along 
Wasserstein geodesies (see Section [2] for details). These spaces are called (weak) CD{K,o6)- 
spaces. The word weak is sometimes used to emphasize that the convexity is required only 
along one geodesic between any two given probability measures. 

A property of the CD(K, oo)-spaces which complicates the theory is the possibility to have 
branching geodesies. For example the space M? with the Zoo-norm is a CD(0, oo)-space and 
it has lots of branching geodesies. Being the limit as p — > oo (in any reasonable sense) of 
the spaces M 2 with the Z p -norm - which are non-branching CD(K, oo)-spaces - this example 
in particular illustrates that being non-branching is not a stable property. A number of 
results in CD(K, oo)-spaces have been proven only under the extra assumption that there 
are no branching geodesies in the space. Although in some of the results this assumption has 
recently been removed (see for example \12\ lllj ) in many it still remains. 

Because branching geodesies are hard to deal with, it is reasonable to consider more restric- 
tive definitions that exclude spaces with branching geodesies or at least limit the amount of 
branching that can occur. One of the essential properties of the definitions of Ricci-curvature 
lower-bounds in metric measure spaces is the stability under the measured Gromov-Hausdorff 
convergence. Therefore any stable definition that extends the Riemannian case should include 
the limit spaces of Riemannian manifolds with uniform Ricci-curvature lower-bounds. While 
Riemannian manifolds are known to be non-branching, to our knowledge it is not known 
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whether this holds for their limit spaces. The spaces we consider in this note cover also these 
limit spaces. Although our result does not rule out the possibility to have some branching 
geodesies, it still says that there has to be so few branching geodesies that optimal transports 
between any two absolutely continuous measures do not see them. Let us now state our main 
result. We refer to Section [2] for the notation we use here. 

Theorem 1.1. Let (X, d) be a complete separable metric measure space with a locally finite 
measure m. Let 7v S OptGeo(/io, Mi) with Mo>Mi £ &2{X) absolutely continuous. Suppose 
that for every i, s € [0, 1] and any Borel function f : Geo(X) — > [0, oo) with f Gco ( X ) Z^ 71 " = 1 
the entropy Ent m is K-convex along (restr*)^t(/7r). Then it is concentrated on a set of non- 
branching geodesies. 

As an immediate corollary of Theorem ll.il we have the following result. 

Corollary 1.2. Let (X, d) be a complete separable metric measure space with a locally finite 
measure m where Ent m is K-convex along every n G OptGeo(M(b Mi) f or every MOjMi £ 
&2{X). Then for every Mo>Mi S S^iiX) which are absolutely continuous with respect to m 
we have that any n £ OptGeo(Mo ; Mi) * s concentrated on a set of non-branching geodesies. 

One of the cases which is covered by Theorem 11.11 and Corollary 11.21 are the RCD(K, oo)- 
spaces, that is, spaces with Riemannian Ricci-curvature bounded from below by some constant 
K 6 R. These spaces were recently defined and studied in [2j |3], see also pp. They were 
obtained by reinforcing the CD (K, oo)-spaces with a requirement that the local structure of 
the space must be Hilbertian. The Hilbertian structure immediately rules out spaces like 
the above mentioned M 2 with the Zoo-norm. In [3] it was shown that one of the equivalent 
formulations of RCD(K, oo)-spaces is that any probability measure with finite second moment 
is the starting point of an EVI^-gradient flow of the entropy. This condition is known to 
imply iC-convexity of the entropy along every geodesic [7] . 

In [3] it was also proven that the definition of RCD(K, oo) is stable under the measured 
Gromov-Hausdorff convergence (or under the D-convergence introduced in p3]). When we 
combine this result with Theorem 11.11 we arrive at the following corollary which in fact was 
our main motivation to write this note. 

Corollary 1.3. The RCD(K, oo)- spaces provide a stable abstract notion of non-branching 
spaces with Ricci-curvature bounded from below. 

Here the amount of non-branching is measured as before: Let (X, d,m) be an RCD(K, oo)- 
space in the sense o/[3] or pp. Then for every MOjMi G &2{X) which are absolutely continuous 
with respect to m we have that any it E OptGeo(MO) Mi) ^ s concentrated on a set of non- 
branching geodesies. 

As we already mentioned, we are not aware of any other non-branching results even for the 
Gromov-Hausdorff limits of Riemannian manifolds with Ricci-curvature bounded from below 
by some constant. 

We also note that the conclusion of Corollary 11.21 can be stated in a stronger looking 
equivalent form. Namely, the measure 7r lives on a set of geodesies where no two distinct 
geodesies meet at some time t G (0,1). To see this, consider a set of geodesies where there 
is such crossing. By mixing the geodesies before and after the crossing we produce a set of 
branching geodesies without affecting the initial and final measures. A contradiction follows 
then via Theorem ll.il 
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By inspecting the proof of [HJ Theorem 3.3] where the existence of optimal maps in non- 
branching CD{K, oo)-spaces was shown we can make another refinement of the statement of 
Corollary 11.21 Regarding this refinement, stated in Corollary 11.44 we note that in [8] there 
was an extra assumption for the measures /j,q and \i\ to have finite entropy. This assumption 
was needed for showing that all absolutely continuous measures with respect to a measure 
7r G OptGeo(/xo, fJ-i) satisfying (|2.ip also satisfy (|2.ip . Here this conclusion is already as an 
assumption so finiteness of the initial and final entropies are not needed. 

Corollary 1.4. Let (X, d) be a complete separable geodesic metric measure space with a 
locally finite measure m where Ent m is K-convex along every 7v G OptGeo(/io, A*i) for every 
^0,^1 G 2?i{X). Then for every Ho,^i S &2(X) which are absolutely continuous with respect 
to m there is a unique tt G OptGeo(/io, A*i) an d ^ ^ s induced by a map. 

In the classical situation of Euclidean spaces the existence of optimal transport maps was 
proven by Brenier in [6]. Since then there have been several generalizations of this result. Most 
relevant in the context of this note are the generalizations of McCann [10] for Riemannian 
manifolds, of Bertrand [5] for Alexandrov spaces, and the most recent resutls of Gigli [8] in 
non-branching CD(K, N) and CD(K, oo)-spaces and of Ambrosio and the first author [3] in 
strongly non-branching metric spaces. Notice that as in the approach by Gigli [8] our proof 
for the existence of optimal transport maps does not use Kantorovich potentials. 



We work in a complete separable metric space (X, d) equipped with a locally finite Borel 
measure m. We write the set of all the Borel probability measures on X as £P(X). We 
denote by S?2{X) the subset of &(X) consisting of probability measures with finite second 
moments. We equip the space &2{X) with the Wasserstein 2-distance which for any two 
measures po, pi G £?2{X) is defined as 



where the infimum is taken over all a G 3 > 2{X x X) with fiQ = (p 1 )^ and [i\ = (p 2 )#<7. 
Here, and later on, p fc denotes the projection to the /c:th coordinate. 

All the geodesies we consider in this note are constant speed geodesies parametrized by 
the unit interval [0,1]. We denote the set of all such geodesies in X by Geo(X), and for 
7 G Geo(X) and t G [0, 1] we use the abbreviation ~f t = 7(*)- The distance between 7 X ,7 2 G 
Geo(X) is given by max{d(7j, 7 2 ) : < t < 1}. We define for all s,t G [0, 1] the restriction 
map restr* : Geo(X) -> Geo(X) : 7 i-> 7 o /* with /* : [0, 1] ->■ [0, 1] : x i->- (s - t)x + t. 

We call a set T C Geo(X) non-branching if for any 7,7 G T we have: if there exists 
to £ (0, 1] such that 7 S = 7 S for all s G [0, to], then 7 = 7. A measure 7r G ^ g (Geo(X)) is said 
to be concentrated on a set of non-branching geodesies, if there exists a non-branching Borel 
set r C Geo(X) so that 7r(T) = 1. The space consisting of all measures 7r G &(Geo(X)) for 
which the mapping t (->• (e 4 )^7r is a geodesic in &(X) from fio = (eo)#7r to fj,\ = (ei)#7r is 
denoted by OptGeo(/io, /Ui). Here the evaluation map is defined as et : Geo(A") — > X : 7 i->- 74. 

The Ricci-curvature is defined using the entropy functional Ent m which for an absolutely 
continuous measure \i = pm G &{X) is defined as 



2. Preliminaries 
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For measures that have a singular part we define Ent m (//) = oo. 

Following the definition in [13] we call a metric measure space (X, d, m) a (weak) CD(K, oo)- 
space, with some K G R, provided that for any ^Oi/^i £ <^2p0 there exists a geodesic 
(Ht) G Geo(<^2(^)) along which Ent m is i^T-convex, that is 

Ent m (/x t ) < (1 - i)Ent m (/x ) + tEnt m ( W ) - —t{l - t)W$(fi , Ml) (2-1) 

holds for all t € [0, 1]. We say that a functional is if-convex along 7r G OptGeo(/Uo, /^i) if it is 
if -convex along the corresponding geodesic (et)#7r. If the inequality (|2.ip is required to hold 
for all geodesies (//t) G Geo(^2(^0)> the space is sometimes called a strong CD(K, oo)-space. 
However, some authors use the term strong to refer to a slightly weaker notion. 

3. Proof of Theorem 11.11 

The idea of the proof is similar to the proof of [12\ Theorem 4]. We prove the claim 
by contradiction. First we find two geodesies in the Wasserstein space which start as the 
same geodesic and then branch out to two completely disjoint ones. By comparing the two 
geodesies separately and on the other hand their sum as one geodesic, we notice that there 
will be an extra drop of log 2 in the entropy during the time interval when the branching 
occurs. In order to arrive at a contradiction this branching has to happen in a small enough 
time interval. 

Let us start the proof with a simple lemma. It allows us to select the two disjoint geodesies 
needed for the contradiction. The geodesies will be selected using a probability measure on 
the product space Geo(X) x Geo(X). Notice that the constant 1/5 in the lemma is not sharp, 
however it is sufficient for our use. 

Lemma 3.1. Let (X, d) be a separable metric space. Then for any a G &{X x X) for which 
cr({(x,x) : x G X}) = there exists E C X so that cr(E X (X \ E)) > 1/5. 

Proof. We will first reduce the general case to the case where X is a finite set. Take e > 0. 
Because the diagonal {(x,x) : x G X} has zero cr-measure, there exists 5 > so that 

tr({(x,y) EXxX : d(x,y) < 6}) < e. (3.1) 

Partition the space X into a countable collection of Borel sets (Qi)i with diameter at most 
5/2. Now there exists some n G N so that 

n 

E ((P fc )# CT ) (Qi) > 1 - e ' for* = 1,2. (3.2) 

i=l 

Therefore, by combining (|3.ip and (|3.2p we have 

o- |J Qi x Qj > 1 - 3e 

\l<i,j<n,i^j J 

and so by forgetting a part of the space with arbitrarily small measure we may assume the 
space X to consist of n points. 

The existence of the set E in the case where X consists of n points follows easily: There 
are a total number of 2 n — 2 ways to select a non-empty set E C X and for any pair of points 
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with i ^ j there are 2 n 2 sets E C X with i £ E and j £ E. Since 



^<r(£x(X\£))=2 



EcX 

there has to be a set E <Z X with 



CT (£x(X\£))> J^>I. 



Taking e > sufficiently small finishes the proof. 



□ 



Now we are ready to continue with the proof of Theorem 11.11 The first reduction steps 
contain ideas which were used in the proofs of [5J Theorem 3.3] and |121 Theorem 4]. The 
rest of the proof is then close to that of [121 Theorem 4] . 

Proof of Theorem \l.l[ Assume that the claim is not true and let 



Because the problem is local in nature, we may assume the measure 7r to be concentrated 
on geodesies living inside some ball B(x,l/2) C X so that in particular the geodesies have 
lengths bounded from above by I. 

From the assumption that n is not concentrated on a non-branching set of geodesies we 
know that there exists some T E (0, 1) so that the measure 7r 7 is not a Dirac mass for a 
(restrj )#7r-positive set of curves 7 € Geo(A), where {7r 7 } C &(Geo(X)) is the disintegration 
of tv with respect to restrj . 

Consider a measure a € ^(Geo(X) x Geo(X)) obtained by integrating up the disintegrated 
measure 7r 7 as a product measure 7r 7 x 7t 7 . That is, for any Borel measurable /: Geo(X) x 
Geo(X) — > [0, 00] we have 



Restrict <x to this set and to simplify the notation still denote by <x this new measure, which 
we renormalize to be a probability measure. 

Furthermore, there exists a 5 € (0, (1 — S)/2) so that 

a({(7 1 ,7 2 ) : there exists t € [T,S] such that restrQ^ 1 ) = restr}^ 2 ) 

and 7g / 7 2 for all s £ (t,t + 5)}) > 0. 

Again we restrict a to this set and renormalize it to be a probability measure. Without loss 
of generality we may assume that there exists a constant C > so that 




(3.3) 



J Geo(X) J (restrj) - 1 ( 7 ) x (restrj ) - 1 ( 7 ) 

There exists some S £ (T, 1) so that 




o-({( 7 1 ,7 2 ) e Geo(X) x Geo(X) : restrg( 7 1 ) ^ restrg( 7 2 )}) > 0. 



d(e )#( P V) d(e 1 )#(p 1 < r) d(e 1 )#(p 2 a) 



< a 



dm ' dm dm 
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Define the function /: [0, 1] — > [0, 1] as 

f{t) = er({(7 1 ,7 2 ) : restr^ 1 = restr^ 2 }). 

Notice that / is decreasing, / = 1 on [0,T] and / = on [S, 1]. Roughly speaking f(t) gives 
the amount of branching that will occur after time t. 

Take a small e G (0, 5) and let t G [T, 5] be such that f(t) — f(t + e) > e. Define 

F = {(T 1 )! 2 ) : restro7 X = restrj^ 2 and restr^^ 1 ^ restrg +e 7 2 }. 

Now cr(F) > e by the choice of t. By Lemma 13, II there exists E C Geo(X) so that 



<t((Ex (Geo(X) \ £)) n f) > > |. 



Let r] > be such that 



<t((£ x (Geo(X) \ £)) nF 



M := a(A) > -± '- > —, (3.4) 

whereA={( 1 \ 1 2 )£(Ex(Geo(X)\E))nF : d( 7t 1 +e , 7t 2 +J > /?}• 

Now define er = <x|^ and using it 7r = (p^^cr. Write p s m = (e s )#7r for all s G [0, 1]. By 
Jensen's inequality and f|3.4[) we get 



/ Pt+e log dm > M log — - > M log t-^ v • (3.5) 
J m(B(x,l/2)) Wm(B(x,l/2)) 

Using (|3.5p we find a set Q C X with diam(Q) < rj so that 

to := tt(G) > 

for G = {7 G Geo(X) : 7 t+e G Q} and 

I ^ log ^ dm > ^log 1Qm(i , ( e , J/2)r (3-6) 

where 7r up = 7ri = (p 1 )#cri and /i" P = /Qs P m = (e s )#7r up for all s G [0,1]. Define 

also 7r down = (p 2 )#o-| GxGeo(x) and ^ own = p down m = (e s ) # 7r down for all s G [0,1]. Notice 

that (restr*) # 7r up = (restr* ) # 7r down and p^_ e 1 pf^ n . 

Since we have not done any rescaling after defining the final <x we notice that the estimate 



/ 



p log p dm < w log C (3.7) 



holds for all p G {p^ 11 = p^ p , pf own , p^ p }. 

Now, similarly as in [12] we can estimate using the if-convexity first along the measure 
(7r up + 7r down )/(2u;) between times 0, t and t + e, and then separately along iv up /w and 
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7r down /to between times t, t + e and 1 to get 

„down „down 

El \ og Ei dm 

w w 

c r „down .down j. r «down i up down j_ up i 7^1 

<-f- / ^Llog^dm+^ / Pt+e +ft ± e i ft+e + £+£ d m+ 

e , C t , 

< — — log — — log 2 

r + e to t + e 

+ / r ndown „down /> ,,up up \ \x< r \ 

+ ^— ) /^log^dm+/^±Mog^dm + 

< 77— log — - 7-7- log 2 

c + e w t + e 

t / e /> < r own 1 pf own , l-t-e f pf own , pr own , 

2(i + e) \1 — t J w w 1 — t J w w J 

ft te \ C t , 

< h 7 77 7 log log 2 

(t + e)(l — t) J w w 
Moving the integral term from the right-hand side to the left and using (|3.3p gives 



„down „down / , j. , \ n f 

i og P^ dm <(^ + E! )io g ii-_Liog2 + |i<:|^ 



(t + e)(l-t)J w w ~\t + e (l-t)(t + e)J w t + e 

( e te \ , C 2t , 

< h 7 77 7 log ; 7 log 2 

~\t + e {l-t){t + e)J *w 3(t + e) B 



which is the same as 

.down ..down 



J w w w 3e 

Using convexity along 7r up /-u; once more together with (|3.8p and (|3.3p yields 

i g n±£ dm < log - + / - log ^ dm + — e(l - t - e)l 2 



w w 1 — t w 1 — t J w w 2 

, C tfl-t-e), 

<log ^— Mog2. 

to 3e 

When we combine this with (13.61) we have 



3£ (' OE 10m Wx, 1/2)) " C j < - < - £ ) log 2 < log 2 
which is a contradiction as the left-hand side goes to when e 4. 0. □ 
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